arXiv:1503.08804v3 [cs.DM] 23 Dec 2015 


RANDOM SAMPLING IN COMPUTATIONAL ALGEBRA: 
KELLY NUMBERS AND VIOLATOR SPACES 

JESUS A. DE LOERA, SONJA PETROVIC, AND DESPINA STASI 


Abstract. This paper transfers a randomized algorithm, originally used in geometric optimization, 
to computational problems in commutative algebra. We show that Clarkson’s sampling algorithm 
can be applied to two problems in computational algebra: solving large-scale polynomial systems and 
hnding small generating sets of graded ideals. The cornerstone of our work is showing that the theory 
of violator spaces of Gartner et al. applies to polynomial ideal problems. To show this, one utilizes 
a Helly-type result for algebraic varieties. The resulting algorithms have expected runtime linear in 
the number of input polynomials, making the ideas interesting for handling systems with very large 
numbers of polynomials, but whose rank in the vector space of polynomials is small (e.g., when the 
number of variables and degree is constant). 
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1. Introduction 

Many computer algebra systems offer excellent algorithms for manipulation of polynomials. But 
despite great success in the field, many algebraic problems have bad worst-case complexity. For 
example, Buchberger ’s [la [a [n] groundbreaking algorithm, key to symbolic computational alge¬ 
bra today, computes a Grobner basis of any ideal, but it has a worst-case runtime that is doubly 
exponential in the number of variables [22] • This presents the following problem: what should one do 
about computations whose input is a very large, overdetermined system of polynomials? In this paper, 
we propose to use randomized sampling algorithms to ease the computational cost in such cases. 

One can argue that much of the success in computation with polynomials (of non-trivial size) often 
relies heavily on finding specialized structures. Examples include Faugere’s et al. fast computation 
of Grobner bases of zero-dimensional ideals [251 |26l[271[29l[35|, specialized software for computing 
generating sets of toric ideals [1], several packages in [32] built specifically to handle monomial 
ideals, and the study of sparse systems of polynomials (i.e., systems with fixed support sets of 
monomials) and the associated homotopy methods [35|. A more recent example of the need to find 
good structures is in m, where Gifuentes and Parrilo began exploiting chordal graph structure in 
computational commutative algebra, and in particular, for solving polynomial systems. Our paper 
exploits combinatorial structure implicit in the input polynomials, but this time akin to Helly-type 
results from convex discrete geometry [36]. 

At the same time, significant improvements in efficiency have been obtained by algorithms that 
involve randomization, rather than deterministic ones (e.g. [mills]); it is also widely recognized 
that there exist hard problems for which pathological examples requiring exponential runtimes occur 
only rarely, implying an obvious advantage of considering average behavior analysis of many algo¬ 
rithms. For example, some forms of the simplex method for solving linear programming problems 
have worst-case complexity that is exponential, yet [44] has recently shown that in the smoothed 
analysis of algorithms sense, the simplex method is a rather robust and fast algorithm. Smoothed 
analysis combines the worst-case and average-case algorithmic analyses by measuring the expected 
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performance of algorithms under slight random perturbations of worst-case inputs. Of course, prob¬ 
abilistic analysis, and smoothed analysis in particular, has been used in computational algebraic 
geometry for some time now, see e.g., the elegant work in PEKE]. The aim of this paper is to 
import a randomized sampling framework from geometric optimization to applied computational 
algebra, and demonstrate its usefulness on two problems. 

Our contributions. We apply the theory of violator spaces m to polynomial ideals and adapt 
Clarkson’s sampling algorithms m to provide efficient randomized algorithms for the following 
concrete problems: 

(1) solving large (overdetermined) systems of multivariate polynomials equations, 

(2) finding small, possibly minimal, generating sets of homogeneous ideals. 

Our method is based on using the notion of a violator space. Violator spaces were introduced 
in 2008 by Gartner, Matousek, Riist, and Skovroh m in a different context. Our approach allows 
us to adapt Clarkson’s sampling techniques m for computation with polynomials. Clarkson-style 
algorithms rely on computing with small-size subsystems, embedded in an iterative biased sampling 
scheme. In the end, the local information is used to make a global decision about the entire system. 
The expected runtime is linear in the number of input elements, which is the number of polynomials 
in our case (see m for a more recent simplified version of Clarkson’s algorithm for violator spaces). 
Violator spaces naturally appear in problems that have a natural linearization and a sampling size 
given by a combinatorial Helly number of the problem. While violator spaces and Clarkson’s algo¬ 
rithm have already a huge range of applications, to our knowledge, this is the first time such sampling 
algorithms are being used in computational algebraic geometry. For an intuitive reformulation of 
Helly’s theorem for algebraic geometers, see Example 12.11 Main ingredients of violator spaces are 
illustrated through Examples 12.2113.21 and 13.61 A typical setup where problem (1) can be difficult 
and a randomized algorithm appropriate can be found in Example 14.81 

Before stating the main results, let us fix the notation used throughout the paper. We assume the 
reader is acquainted with the basics of computational algebraic geometry as in the award-winning 
undergraduate textbook |18] . Denote by iF a (algebraically closed) field; the reader may keep K = C 
in mind as a running example. Let /i = 0,..., /m = 0 be a system of m polynomials in n-|-1 variables 
with coefficients in K. We usually assume that m ^ n. As is customary in the algebra literature, 
we write fi,..., fm S 54 = K[xo ..., Xn\ and often denote the polynomial ring 93 by a shorthand 
notation K\x\. We will denote by (/i,..., fm) C 93 the ideal generated by these polynomials; that 
is, the set of all polynomial combinations of the ffs. Note that if E = {/i,... , fm} is a set of 
polynomials, the ideal (/i,..., fm) will equivalently be denoted by (F). 

A polynomial is said to be homogeneous if all of its terms are of same degree; an ideal generated 
by such polynomials is a homogeneous ideal. In this paper, the ideals we consider need not be 
homogeneous; if they are, that will be explicitly stated. In that case, the set of all homogeneous 
polynomials of total degree d will be denoted by [93]^. Finally, denote by V(5) the (affine) variety 
dehned by the set of polynomials S' G 93, that is, the Zariski closure of the set of common zeros of 
the polynomials in the system S. Therefore, the concrete problem (1) stated above simply asks for 
the explicit description of the variety (solution set) corresponding to an ideal (system of polynomial 
equations). The concrete problem (2) asks to find a smaller (e.g., minimal with respect to inclusion) 
set of polynomial equations that generate the same ideal - and thus have the exact same solution 
set. 

Solving large polynomial systems. Suppose we would like to solve a system of m polynomials 
in n -|- 1 variables over the field K, and suppose that m is large. We are interested in the coefficients 
of the polynomials as a way to linearize the system. To that end, recall first that the d-th Veronese 
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embedding of P"" is the following map : P"" —)• p( ^ 

(xo : • • • : Xn) eA {xq : Xq~^xi : • • • : x^). 

The map induces a coefficient-gathering map for homogeneous polynomials in fixed degree d: 

coeffrf : [lR]rf ^ 

Cq-i ,..., , 

where x“' corresponds to the z-th coordinate of the d-th Veronese embedding. We follow the usual 
notation |a| = cij. Therefore, if / is a homogeneous polynomial of deg(/) = d, coeff(/) is a vector 

in the K-vectoi space ^ ). This construction can be extended to non-homogeneous polynomials 
in the following natural way. Consider all distinct total degrees di,... ,ds of monomials that appear 
in a non-homogenous polynomial /. For each di, compute the image under coeff^. of all monomials 
of / of degree di. Finally, concatenate all these vectors into the total coefficient vector of /, which 
we will call coeff(/) and which is of size the number of monomials in n -|- 1 variables of 

(total) degree ranging from 0 to d. In this way, a system fi,..., fm of polynomials in n variables 
of degree at most d can be represented by its coefficient matrix of size x m. Each column 

of this matrix corresponds to the vector produced by the map coeff^ above. This map allows us to 
think of polynomials as points in a linear affine space, where Helly’s theorem applies. 

We utilize this construction to import Clarkson’s method m for solving linear problems to al¬ 
gebraic geometry and, in particular, we make use of Helly-type theorems for varieties. Helly-type 
theorems allow one to reduce the problem of solving the system to repeated solution of smaller 
subsystems, whose size is a Kelly number of intersecting linear spaces. As a result, our algorithms 
achieve expected linear runtime in the number of input equations. 

Theorem 1.1. Let F = {fi,..., fm} GiLi be a system of polynomials, and let 5 be the dimension of 
the vector subspace generated by the coefficient vectors of the fi’s, as described above. 

Then there exists a sampling algorithm that outputs F' = j ■ ■ • j /i^} G F such thatV{F) = V{F') 
in an expected number O [dm + of calls to the primitive query that solves a small radical ideal 

membership problem. F and F' generate the same ideal up to radicals. 

It is important to point out that our sampling algorithm will find a small subsystem of the input 
system that, when solved with whatever tools one has at their disposal, will give the same solution 
set as the original (input) system. Here by ‘small’ we mean a system of size 6, where 6 is polynomially 
bounded or constant when the number of variables is constant or when the degree d is small. 

That the rank 5 of the coefficient matrix of the system of polynomials gives the combinatorial 
dimension for this problem is shown in Theorem 14.111 There are several interesting special cases of 
this result. For example, we obtain [211 Corollary 2] as a corollary: if /i,..., /m £ K[xo ,..., x„] are 
homogeneous and of degree at most d each, then the dimension 5 of the vector subspace they generate 
is at most (see Lemma [4.6p . Of course, in many situations in practice, this bound is not sharp, 

as many systems are of low rank. For example, this situation can arise if the monomial support 
of the system is much smaller than the total number of monomials of degree d. In light of this. 
Theorem 14.111 gives a better bound for low-rank systems. Note that we measure system complexity 
by its rank, that is, the vector space dimension 6, and not the usual sparsity considerations such as 
the structure of the monomial support of the system. Further, our result applies to non-homogeneous 
systems as well. Its proof is presented in Section 01 along with the proof of Theorem 11.11 


^ CaX" i-A 
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Computing small generating sets of ideals. The problem of finding “nice” generating sets of 
ideals has numerous applications in statistics, optimization, and other fields of science and engineer¬ 
ing. Current methods of calculating minimal generating sets of ideals with an a priori large number 
of generators are inefficient and rely mostly on Grobner bases computations, since they usually in¬ 
volve ideal membership tests. Of course there are exceptional special cases, such as ideals of points 
in projective space m or binomial systems [T]. Our second main result shows how to efficiently 
extract a small or close to minimal generating set for any ideal from a given large generating set and 
a bound on the size of a minimal generating set. 

Theorem 1.2. Let I = (H) be an ideal generated by a (large) finite set of homogeneous polynomials 
H, and suppose that 'y is a known upper bound for the 0-th total Betti number f3{R/I). 

Then there exists a randomized algorithm that computes a generating set of I of size 7 in expected 
number of 0{'y\H\ -I- 7 "^) calls to the primitive query that solves a small ideal membership problem. 

In particular, if 'j = fi{R/I), the algorithm computes a minimal generating set of I. 

The proof is presented in Section [5l 

2. A Warm-Up: Algebraic Helly-type theorems and the size of a meaningful sample 

A Helly-type theorem has the following form: Given a family of objects F, a property P, and 
a Helly number 5 such that every subfamily of F with 6 elements has property P, then the entire 
family has property P. (See |19[ [23l Wf\ 0].) In the original theorem of E. Helly, E is a finite family 
of convex sets in M"", the constant <5 is n -|- 1, and the property P is to have a non-empty intersection 
|34j . Here we are looking for non-linear algebraic versions of the same concept, where the objects in 
F are algebraic varieties (hypersurfaces) or polynomials; the property desired is to have a common 
point, or to generate the same ideal; and the Helly constant 5 will be determined from the structure 
of the problem at hand. To better understand the algorithms that we present, it is instructive to 
consider two intuitive easy examples that highlight the fundamental combinatorial framework. The 
first one is an obvious reformulation of Helly’s theorem for algebraic geometers. 

Example 2.1. Let H = {Li,L 2 , ■ ■ ■ ,Ls} be a family of affine linear subspaces in M”’. Consider the 
case when s is much larger than n. One would like to answer the following question: when do all of 
the linear varieties have a nonempty intersection? It is enough to check whether each subfamily of 
H with n -|- 1 elements has a non-empty intersection, as that would imply, by Helly’s Theorem, that 
H also has a non-empty intersection. Thus, in practice, one can reduce the task of deciding whether 
/ 0 to the collection of smaller queries f^O^lLi.. However, instead of testing all possible 
(„+i) many (n -|- l)-tuples, we may choose to randomly sample multiple (n -|- l)-tuples. Each time 
we sample, we either verify that one more (n-|- l)-tuple has a non-empty intersection thus increasing 
the certainty that the property holds for all (n-|-l)-tuples, or else find a counterexample, a subfamily 
without a common point, the existence of which trivially implies that = 0. This simple idea 

is the foundation of a randomized approach. Eor now we ask the reader to observe that n -|- 1 is the 
dimension of the vector space of (non-homogeneous) linear polynomials in n variables. 

Example 2.2. The next example is just slightly more complicated, but illustrates well some key 
concepts. Gonsider next H = {/i(xi, X 2 ), f 2 ixi,X 2 ), ■ ■ ■ , /s(xi, X 2 )}, a large family of affine real plane 
curves of degree at most d. Imagine that H is huge, with millions of constraints fi, but the curves 
are of small degree, say d = 2. Nevertheless, suppose that we are in charge of deciding whether the 
curves in H have a common real point. Clearly, if the pair of polynomials f,g^H intersect, they 
do so in finitely many points, and, in particular, Bezout’s theorem guarantees that no more than d? 
intersections occur. One can observe that if the system H has a solution, it must pass through some 
of the (at most d?) points defined by the pair f,g alone. In fact, if we take triples f,g,h £ H, the 
same bound of d^ holds, as well as the fact that the solutions for the entire H must also be part of 
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the solutions for the triplet f,g, h. Same conclusions hold for quadruples, quintuples, and in general 
(5-tuples. But how large does an integer <5 have to be in order to function as a Helly number? We 
seek a number 6 such that if all 5-tuples of plane curves in H intersect, then all of the curves in H 
must intersect. The reader can easily find examples where 6 = d does not work, e.g., for d>2. 

To answer the question posed in Example 12.21 we refer to Theorem 14.111 in Section 01 Without 
re-stating the theorem here, we state the following Corollary and note that it gives a nice bound on 
6. Corollary 12.31 is implied by the observation that there are only monomials in two variables 

of degree < d (which says they span a linear subspace of that dimension inside the vector space of 
all polynomials) and Theorem 14.111 

Corollary 2.3. Let H = {/i(x, y),/ 2 (x, y),... ,/^(x, y)} he a family of affine real plane curves of 
degree at most d. If every 5 = of the curves have a real intersection point, then all the curves 

in H have a real intersection point. If we consider the same problem over the complex numbers, then 
the same bound holds. 

Thus, it suffices to check all 5-tuples of curves for a common real point of intersection, and if all of 
those instances do intersect, then we are sure all \II\ polynomials must have a common intersection 
point, too. The result suggests a brute-force process to verify real feasibility of the system, which 
of course is not a pretty proposition, given that \H\ is assumed to be very large. Instead, Section [3] 
explains how to sample the set of 5-tuples in order to obtain a solution to the problem more efficiently. 
Notice that it is important to find a small Helly number 5, as a way to find the smallest sampling 
size necessary to detect common intersections. It turns out that in this example and in the case 
when all /* are homogeneous, the Helly number is best possible |28j . 

3. Violator spaces and Clarkson’s sampling algorithms 

The key observation in the previous section was that the existence of Helly-type theorems indicates 
that there is a natural notion of sampling size to test for a property of varieties. Our goal is to import 
to computational algebra an efficient randomized sampling algorithm by Clarkson. To import this 
algorithm, we use the notion of violator spaces which we outline in the remainder of this section. We 
illustrate the definitions using Example 12.21 as a running example in this section. 

In 1992, Sharir and Welzl 01] identified special kinds of geometric optimization problems that 
lend themselves to solution via repeated sampling of smaller subproblems: they called these LP-type 
problems. Over the years, many other problems were identified as LP-type problems and several 
abstractions and methods were proposed [2l[3l[IIl|33l[37]. A powerful sampling scheme, devised by 
Clarkson m for linear programming, works particularly well for geometric optimization problems in 
small number of variables. Examples of applications include convex and linear programming, integer 
linear programming, the problem of computing the minimum-volume ball or ellipsoid enclosing a 
given point set in M”', and the problem of finding the distance of two convex polytopes in M”. In 
2008, Gartner, Matousek, Riist and Skovroh [3l] invented violator spaces and showed they give a 
much more general framework to work with LP-type problems. In fact, violator spaces include all 

prior abstractions and were proven in 02] to be the most general framework in which Clarkson’s 

sampling converges to a solution. Let us begin with the key definition of a violator space. 

Definition 3.1 ( |31j ). A violator space is a pair (H, V), where H is a finite set and V a mapping 
2^ —>■ 2^, such that the following two axioms hold: 

Consistency: G n V(G) = 0 holds for all G C H, and 

Locality: V(G) = V(E) holds for all E C G C H such that G n V(E) = 0. 

Example 3.2 (Example 12.21 continued). To illustrate our dehnition, we consider Example 12.21 of s 
real plane curves {fi,..., fs} = H■ 
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A violator operator for testing the existence of a real point of intersection of a subset F <Z H oi the 
curves should capture the real intersection property. One possible way to define it is the following 
map \/real : 2 ^ —)• 2 ^: 

yreal{F) = {h & H : Vr(F) D Vm(F U {h})} , 

where Vr(F) is the set of common real intersection points of F, in other words, the real algebraic 
variety of F. Note that, by definition, VreaiiF) = 0 if the curves in F have no real points of intersec¬ 
tion. Before explaining why Vreai captures the real intersection property correctly (see Example l3.6p . 
let us show that the set {H,\/reai) is a violator space according to Definition 13.11 

Consistency holds by definition of for any h G F, Vk(F) = Vk(EU{/i}), and so h 0 yreai{F)- 

To show locality, we begin by showing the auxiliary fact that Vr(F) = Vr(G). The inclusion 
Vr(G) C Vr(F) is direct. This is because for any S' F S it is always the case that Vr(S'') D Vr(5). 
We need only show Vr(F) C Vr(G). Recall F F G F H and GF\/reai{F) = 0- Consider g G G. 
By the assumption, g does not violate F, and the proper containment Vr(F) F Vr(FU { 5 }) does 
not hold. Therefore the equality Vr(F) = Vr(F U { 5 }) must hold. By iteratively adding elements 
of G \ F to F and repeating the argument, we conclude that indeed Vr(F) = Vr(G). 

Finally, we argue that Vr(F) = Vr(G) implies \/reaiiF) = '^reaiiG)- We first show yreai(F) F 
VreaiiG). Take h G VreaiiF). Then Vr(G) = Vr(F) D Vr(F U {/i}) = Vr(F) n Vu{{h}) = 
VR(G)nVR({h}) = Vr(GU{/i}). The last and third-to-last equalities follow from the fact that for any 
two sets Si, S 2 , one always has Vr( 5 'iUS' 2 ) = VR(Si)nVR(S' 2 ). The containment \/reai{F) D yreai{G) 
follows in a similar argument. Thus {F[, Vreai) is a violator space. 

Every violator space comes equipped with three important components: a notion of basis, its 
combinatorial dimension, and a primitive test procedure. We begin with the definition of a basis of 
a violator space, analogous to the definition of a basis of a linear programming problem: a minimal 
set of constraints that defines a solution space. 

Definition 3.3 f [3T| Definition 7]). Consider a violator space (F, V). B F H is a basis if FnV(F) 7 ^ 
0 holds for all proper subsets F F B. For G F H, a basis of G is a minimal subset F of G with 
V(F)=V(G). 

It is very important to note that a violator operator can capture algebraic problems of interest 
as long as the basis for that violator space corresponds to a basis of the algebraic object we study. 
Violator space bases come with a natural combinatorial invariant, related to Belly numbers we 
discussed earlier. 

Definition 3.4 f [3Tl Definition 19]). The size of a largest basis of a violator space {H,V) is called 
the combinatorial dimension of the violator space and denoted by 5 = 5{H, V). 

A crucial property was proved in m- knowing the violations V(G) for all G C F is enough to 
compute the largest bases. To do so, one can utilize Clarkson’s randomized algorithm to compute 
a basis of a violator space (F, V) with m = \H\. The results about the runtime and the size of the 
sets involved are summarized below. The primitive operation, used as black box in all stages of the 
algorithm, is the violation test primitive. 

Definition 3.5. Given a violator space (F, V), some set G F H, and some element h G H \ G, the 
primitive test decides whether h G V(G). 

The running example illustrates these three key ingredients. 

Example 3.6 (Example l3.2[ continued). In the example of s real plane curves, the violator operator 
we dehned detects whether the polynomials have a real point of intersection. Note that a basis would 
be a (minimal) set of curves B = {/q,...,/i^}, for some 6 < s, such that either the curves in B 
have no real point of intersection, or the real points of intersection of the curves in B are the real 
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intersection of all oi H = {/i,..., /*}. If the set F has no real intersection point, then \/reai{F) = 0 by 
definition, so that set F could be a basis in the sense that it is a certihcate of infeasibility for this real- 
intersection problem. If, on the other hand, F does have a real intersection point, and \lreai{F) = 0; 
then this means that is a basis in the sense that the curves in F capture the intersections of all of 
H. The combinatorial dimension for general H is provided by Corollary 12.31 and it equals 6 = 
However, special structure of the curves in H may imply a smaller combinatorial dimension. 

The primitive query simply checks, given fi ^ H and a candidate subset G F F[, whether the 
set of real points of intersection of G L) {fi} is smaller than the set of real points of intersection of 
the curves in G alone. The role of the primitive query is therefore not to find a basis directly, but 
to check, instead, whether a given candidate subset G can be a basis of H. This can be done by 
checking whether fi G \/reaiiG) for all fi £ F[ \ G. Clearly, given the primitive test, a basis for H can 
be found by simply testing all sets of size at most 5, but that would be a waste because the number 
of times one would need to call the primitive would be 0{\F[f^^). 

As we will see, this brute-force approach can be avoided. Namely, in our current example, the 
randomized algorithm from Theorem 13.71 below will only sample subsets of 5 = curves from the 
set {fi,..., fs}, and find a basis of the violator space of size 6 in the sense explained above. 

The sampling method in m avoids a full brute-force approach. It is presented in two stages, 
referred to as Clarkson’s first and second algorithm. We outline these below. 

Clarkson’s first algorithm, in the first iteration, draws a small random sample R F G, calls the 
second stage to calculate the basis C of R, and returns G if it is already a basis for the larger subset 
G. If G is not already a basis, but the elements of G \ C violating R are few, it adds those elements 
to a growing set of violators W, and repeats the process with G being calculated as the basis of the 
set W F R for a new randomly chosen small R F G \ W. The crucial point here is that \R\ is much 
smaller than |G| and, consequently, it acts as a Helly number of sorts. 

Clarkson’s second algorithm (Basis2) iteratively picks a random small (6(5^ elements) subset R of 
G, finds a basis G for R by exhaustively testing each possible subset (BruteForce;) taking advantage 
of the fact that the sample R is very small, and then calculates the violators of G \ G. At each 
iteration, elements that appear in bases with small violator sets get a higher probability of being 
selected. 

This idea is very important: we are biasing the sampling process, so that some constraints will 
be more likely to be chosen. This is accomplished by considering every element h of the set G as 
having a multiplicity tn(/i); the multiplicity of a set is the sum of the multiplicities of its elements. 
The process is repeated until a basis of G is found, i.e. until V(G \ G) is empty. 

Again, as described above, all one needs is to be able to answer the Primitive query: Given 
G F H and h G H\G, decide whether h G V{G). The runtime is given in terms of the combinatorial 
dimension 5{F[,V) and the size of H. The key result we will use in the rest of the paper concerns 
the complexity of finding a basis: 

Theorem 3.7. |31l Theorem 27] Using Clarkson’s algorithms, a basis of H of a violator space {H,\/) 
can be found by answering the primitive query an expected O [6 \ H\ + times. 

It is very important to note that, in both stages of Clarkson’s method, the query h G V(G) 
is answered via calls to the primitive as a black box. In our algebraic applications, the primitive 
computation requires solving a small-size subsystem (e.g., via Grobner bases or numerical algebraic 
geometry methods), or an ideal membership query applied to the ideal generated by a small subset 
of the given polynomials. On the other hand, the combinatorial dimension relates to the Helly 
number of the problem which is usually a number that is problem-dependent and requires non-trivial 
mathematical results. 
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Algorithm 1: Clarkson’s first algorithm 


input : G C H, 6: combinatorial complexity of H 
output: B, a basis for G 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 


if |G| < 95^ then 
I return Basis2(G) 
else 

VC ^ 0 

repeat 

R •(— random subset of G \ VC with [5 y^|G|J elements. 
G ^ Basis2(VC U R) 

V ^ {he G\C s.t. hG V(G)} 
if |B| < 2v^ then 
I W^WUV 
end 

until y = 0 
end 

return G. 


Algorithm 2: Clarkson’s second algorithm: Basis2(G) 
input : G 'e H] 6: combinatorial complexity of H. 
output: B: a basis of G 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


if |G| < 65^ then 
I return BruteForce(G) 
else 

repeat 

R e- random subset of G with 65^ elements. 
G ^ BruteForce(ii) 

{/i G G\G s.t. hG V(G)} 
if m(F) < m(G)/3(5 then 
for h eV do 
I m(/i) ^ 2m{h) 
end 
end 

until F = 0 
end 

return G. 


In the two sections that follow we show how violator spaces naturally arise in non-linear algebra 
of polynomials. 


4. A VIOLATOR SPACE FOR SOLVING OVERDETERMINED SYSTEMS 

We discuss our random sampling approach to solve large-size (non-linear) polynomial systems by 
applying Clarkson’s algorithm. In particular, we prove Theorem ll.il as a corollary of Theorem 14.111 
This result is motivated by, and extends, Helly-type theorems for varieties from m and [28], which 
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we use to show that the above algorithms apply to large dense homogeneous systems as well (Corol¬ 
lary I4.7P . 

First, we define a violator space that captures (in the sense explained in the previous section) 
solvability of a polynomial system. 

Definition 4.1. [Violator Space for solvability of polynomial systems] Let S' C if be finite subsets 
of polynomials in 91. Define the violator operator Vgoive : 2^ —)• 2^ to record the set of polynomials 
in H which do not vanish on the variety V(S). Formally, 

\/soive{S) = {/ € Lf : V(S) is not contained in V(/)}. 

Lemma 4.2. The pair {H,\/solve) ® violator space. 

Proof. Note that \/soiveiS) n S = 0 by definition of \/soiveiS), and thus the operator satisfies the 
consistency axiom. 

To show locality, suppose that F T G C H and G n \/soiveiF) = 0. Since F C G we know that 
V{G) C V{F). On the other hand, by definition, G n \/soiveiF) = 0 implies that V(F) C V{g) for all 
g G G. Thus V{F) is contained in flgec ^(S') ~ But then the two varieties are equal. 

To complete the argument we show that V{F) = V{G) implies \/soive{F) = VgoiveiG)- If h G 
'^soiveiF) then V{h) cannot contain V{F) = V{G), thus h G ^soive{G) too. The argument is sym¬ 
metric, hence \/soive{F) = VsoiveiG). □ 

It follows from the definition that the operator y solve gives rise to a violator space for which a 
basis B oi G <Z H a. set of polynomials such that V{B) = V(G). Therefore, a basis B <Z G will 
either be a subset of polynomials that has no solution and as such be a certificate of infeasibility of 
the whole system G, or it will provide a set of polynomials that are sufficient to find all common 
solutions of G, i.e., the variety V(G). 

Next, we need a violation primitive test that decides whether h G ^soive{F)j in Definition 13.51 
By the definition above, this is equivalent to asking whether h vanishes on all irreducible components 
of the algebraic variety V{F). As is well known, the points of V{F) where the polynomial h does 
not vanish correspond to the variety associated with the saturation ideal ((F) : h°°). Thus, we may 
use ideal saturations for the violation primitive. For completeness, we recall the following standard 
definitions. The saturation of the ideal (F) with respect to /, denoted by ((F) : f°°), is defined to 
be the ideal of polynomials <7 G 91 with f"^g G I for some m > 0. This operation removes from 
the variety V(F) the irreducible components on which the polynomial / vanishes. Recall that every 
variety can be decomposed into irreducible components (cf. |181 Section 4.6] for example). The 
corresponding algebraic operation is the primary decomposition of the ideal defining this variety. 

Lemma 4.3 (e.g. [5l Chapter 4]). Let be a minimal primary decomposition for the ideal I. 

The saturation ideal (/ : /°°) equals 

Proof. It is known that : /°°) = DfLjiQi : /°°). We observe further that {Qi : /°°) = Qi if 

/ does not belong to \fQi and (Qi : /°°) = (1) otherwise. □ 

This allows us to set up the primitive query for {P[,\/solve)- However we do not need to calculate 
the decomposition explicitly, but can instead carry it out using elimination ideals via Grobner bases, 
as explained for example in |18l Exercise 4.4.9]. 

Observation 4.4. The primitive query for (H, \/ solve) is simply the saturation test explained above. 

Remark 4.5. There is an obvious reformulation of these two ingredients that is worth stating 
explicitly. Namely, since a basis B for the violator space {H, \l solve) is a set of polynomials such 
that V{B) = V{H), the strong Nullstellensatz implies that = ^/JW). Thus a basis deter¬ 

mines the ideal of the input system up to radicals, and we could have named the violator operator 
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^solve = ^radical instead. Furthermore, a polynomial h vanishing on all irreducible components of 
the algebraic variety V(F) is equivalent to h G y/{F), i.e., h belonging to the radical of the ideal 
(F). In particular, the primitive query for Msolve can also be stated as the radical ideal membership 
test. This test can be implemented using Grobner bases, as explained for example in |18l Proposition 
4.2.8]: h € [F) if and only if 1 G (F, 1 — yh) C K[xq, ... ,Xn,y]. Therefore, computation of one 

Grobner basis of the ideal (F, 1 — yh) suffices to carry out this test. 

Finally, we solve the problem of finding a combinatorial dimension for \/solve- For this, consider, 
as a warm up, the simpler situation where we have a Helly-type theorem for hypersurfaces defined 
by homogeneous polynomials. This was proved by Motzkin |39j and then later reproved by Deza 
and Frankl |21) . and it provides us with a combinatorial dimension for guaranteeing that a large- 
scale homogeneous system has a solution. Its proof relies on thinking of the polynomial ring as a 
iF-vector space (see also the discussion before Definition 14.9p . 

Lemma 4.6 ([21], Gorollary 2). Let /i,...,/^ C 91 he a system of homogeneous polynomials, 
that is, fi G [91]^., and define d = max{dj}. Suppose that every subset of p = polynomials 

{/j^, ■ ■ ■, fip} C {/i,..., fm} has a solution. Then the entire system {/i,..., fm} does as well. 

Lemma 14.61 provides the combinatorial dimension that, along with the variety membership primi¬ 
tive from Observation 14.41 allows us to apply Glarkson’s algorithms to the violator space {H,\/solve)- 

Corollary 4.7. Let (/i,..., fm) CiR be an ideal generated by m homogeneous polynomials in n + 1 
variables of degree at most d; fi G [91]^. and d = max{dj}. Let d = Then there is an 

adaptation of Clarkson’s sampling algorithm that, in an expected O [dm + d'^^^^) number of calls to 
the primitive query\^ computes {/q,..., /jj such that V(/i, - - -, fm) = V{fh,- - -, fig)- 

In particular, this algorithm is linear in the number of input equations m, and a randomized 
polynomial time algorithm when the number of variables re -|- 1 and the largest degree d are fixed. 
Furthermore, we can extend it to actually solve a large system: once a basis B = {/q ,..., fig} for the 
space ({/i,..., fm}, solve) is found, then we can use any computer algebra software (e.g. (6] [32l 146] 1 
to solve fi^ = -- - = fig = 0. 

Note that Lemma 14.61 can be thought of as a statement about the complexity of Hilbert’s Null- 
stellensatz. If (/i,..., fm) = 94 (i.e., V(/i,..., fm) = 0), then there exists a subset of size d = 
polynomials {fii, - - -, fig} such that V(/q,...,/i^) = 0 as well. In particular, there is a Nullstel- 
lensatz certificate with that many elements. The dimension is, in fact, only an upper bound, 

attainable only by dense systems. However, in practice, many systems are very large but sparse, and 
possibly non-homogeneous. Let us highlight again that the notion of ‘sparsity’ we consider is cap¬ 
tured by a low-rank property of the system of polynomial equations, made explicit below in terms of 
the coefficient matrix. This is crucially different from the usual considerations of monomial supports 
(Newton polytopes) of the system; instead, we look at the coefficients of the input polynomials - that 
is, we linearize the problem and consider the related vector spaces, as illustrated in the following 
example. 

Example 4.8. Consider the following system consisting of two types of polynomials: polynomials of 
the form xf — 1 for i = 1 ,..., re, and polynomials of the form Xi + Xj for the pairs {i, j : i ^ j mod 2} 
along with the additional pair z = 1, j = 3. This system has m = + n + 1 equations, and the 

interesting situation is when the number of variables is a large even number, that is, n = 2k for any 
large integer k. This system of polynomials generates the 2-coloring ideal of a particular re-vertex 
non-chordal graph. (See pH] and references therein for our motivation to consider this particular 
system.) 
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Consider a concrete graph. Take the ra-cycle with all possible even chords, and one extra edge 
{1, 3}. Thus the pairs {i,j} are indexed by edges of the graph G on n nodes where all odd-numbered 
vertices are connected to all even-numbered vertices, and with one additional edge {1,3}. 

We wish to decide if the system has a solution, but since there are -|- n -|- 1 many polynomials, 
we would like to try to avoid computing a Grobner basis of this ideal. Instead, we search for a 
subsystem of some specific size that determines the same variety. It turns out that the system 
actually has no solution. Indeed, a certificate for infeasibility is a random subsystem consisting of 
the first n quadratic equations, n — 1 of the edge equations Xi -|- Xj with {i,j '■ i ^ j mod 2 }, and 
the additional equation xi -|- X 3 . For example, the first n — 1 edge polynomials will do to construct 
a 2n-sized certificate of this form. Why is the number n -|- (n — 1) -|- 1 = 2n so special? 

To answer this question, let us linearize the problem: to each of the polynomials / associate 
a coefficient (column) vector coeff(/) G whose coordinates are indexed by the monomials 

appearing in the system xf,... ,x‘^,l,xi,... ,Xn- Putting all these column vectors in one matrix 
produces the coefficient matrix of the system of the form 

■ In O' 

-1 0 , 

0 E 

where In is the n x n identity, —1 is the row vector with all entries —1, and E is the vertex-edge 
incidence matrix of the graph G. Since it is known that the rank of an edge-incidence matrix of an 
n-vertex connected graph is n — 1 , the rank of this matrix is 5 = n -|- (n — 1 ) -|- 1 = 2 n. 

Remarkably, the magic size of the infeasibility certificate equals the rank of this coefficient matrix. 

This motivating example suggests that the desired Helly-type number of this problem is captured 
by a natural low-rank property of the system. To define it precisely, let us revisit the extension of 
the Veronese embedding to non-homogeneous polynomials explained in the Introduction. Here we 
adopt the notation from [71 Section 2] and consider polynomials in of degree up to d as a RT-vector 
space denoted by The vector space has dimension which, of course, equals the 

number of monomials in n -|- 1 variables of (total) degree from 0 to d. In this way, any polynomial 
/ G is represented by a (column) vector, coeff(/) G whose entries are the coefficients of /. 

Thus, any system S' C dehnes a matrix with IS"! columns, each of which is an element of 

Definition 4.9. A system S' C 91 is said to have rank D if dim;^(5) = D, where {S) is the vector 
subspace of generated by the coefficients of the polynomials in S. 

We need to also make the notion of Helly-type theorems more precise in the setting of varieties. 

Definition 4.10 (Adapted from Definition 1.1. in [28]). A set S' C 91 is said to have the D-Helly prop¬ 
erty if for every nonempty subset Sq C S, one can find pi ,... ,pD ^ Sq with V(So) = V(pi,... ,Pd)- 

The following result, which implies Theorem 11.11 is an extension of [28] to non-homogeneous 
systems. It also implies (the contrapositive of) Lemma 14.61 when restricted to homogeneous systems, 
in the case when the system has no solution. The proof follows that of [28], although we remove the 
homogeneity assumption. We include it here for completeness. 

Theorem 4.11. Any polynomial system S' C 91 0 / rank D has the D-Helly property. 

In other words, for all subsets V <Z S, there exist pi,... ,p£) gV such that V{V) = V{pi,... ,Pd)- 

Proof. Let P C S be an arbitrary subset of polynomials, and denote by (V) C the vector 

subspace it generates. Let do = dimi^(P). We need to find polynomials pi,...,pD such that 
V{pi,... ,Pd) = V(P). Note that do < D, of course, so it is sufficient to consider the case V = S. 

Choose a vector space basis (pi,... ,p_d) = (7*) = (S). It suffices to show V(pi,... ,Pd) fL V(S); 
indeed, the inclusions V(S) C V{V) C V(pi,... ,Pd) already hold. 
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Suppose, on the contrary, there exists x = (xq, ■ ■ • ,x„) G and p £ S such that p{x) 7 ^ 0 but 

Pi{x) = 0 for all i = 1,..., D. Since piS generate 5 as a vector space, there exist constants 'ji £ K 
with p = implying that p{x) = 0 , a contradiction. □ 

The proof above is constructive: to find a subset pi,... ,pD £ Sq, one only needs to compute a 
vector space basis for (Sq). Thus, linear algebra (i.e., Gaussian elimination) can construct this subset 
in time Od^op). The sampling algorithm based on violator spaces is more efficient. 

Proof of Theorem M.l[ From Lemma 14.21 we know that {{fi, ■ ■ ■, fm},'^solve) is a violator space. 
Theorem 14.111 shows that it has a combinatorial dimension, and Observation 14.41 shows that there 
exists a way to answer the primitive test. Having these ingredients, Theorem 13.71 holds and it is 
possible for us to apply Clarkson’s Algorithm again. □ 

Remark 14.51 provides the following interpretation of Theorem 14.111 

Corollary 4.12. Let I = (/i,..., /m) C 91 and let D = dimx(/i, • • •, fm)- Then, for all subsets V 
of the generators fi,..., fm, there exist pi,... ,pD £ V sueh that y/P = ^y{pl,... ,pd)- 

5. A VIOLATOR SPACE FOR FINDING GENERATING SETS OF SMALL CARDINALITY 

In this section, we apply the violator space approach to obtain a version of Clarkson’s algorithm 
for calculating small generating sets of general homogeneous ideals as defined on page 3. As in 
Section 01 this task rests upon three ingredients: the appropriate violator operator, understanding 
the combinatorial dimension for this problem, and a suitable primitive query which we will use as a 
black box. As before, fixing the definition of the violator operator induces the meaning of the word 
‘basis’, as well as the construction of the black-box primitive. 

To determine the natural violator space for the ideal generation problem, let I C 91 be a homoge¬ 
neous ideal, H some initial generating set of I, and define the operator \/SmaiiGen as follows. 

Definition 5.1. [Violator Space for Homogeneous Ideal Generators] Let S <£ H he finite subsets of 
91. We define the operator ysmaiiGen : 2^ —)• 2^ to record the set of polynomials in H that are not 
in the ideal generated by the polynomials in S. Formally, 

ySmallGeniS) = {f £ H : {S, f) D (5)}. 

Equivalently, the operator can be viewed as ySmaiiGen{S) = {f £ H : f ^ (S)}- 
Lemma 5.2. The pair {H,ysmaiiGen) is a violator space. 

Proof. Note that ysmaiiGeniS) n 5 = 0 by definition of ySmaiiGen{S), and thus the operator satisfies 
the consistency axiom. 

To show locality, suppose that F <£ G C H and G n ysmaUGen{F) = 0- Since F C G, (F) C (G). 
On the other hand GfiysmaiiGen{T) = 0 implies that G C (T) which in turn implies that (G) C (T). 
Then the ideals are equal. Then, because (G) = {F) we can prove that ysmaUGen{T) = ySmaiiGen{G)■ 
Note first that (G) = (F), holds if and only if (G, h) = (F, h) for all polynomials h £ H. 

Finally, to show ySmaiiGen{F) = ySmaiiGeniG), we note that h £ ysmaiiGen{F) if and only if 
{G,h) = {F,h) (F) = (G), and this chain of equations and containment holds if and only if 

h £ ySmaiiGen{G). Therefore, locality holds as well and ysmaiiGen is a violator space operator. □ 

It is clear from the definition that {Ff,ysmaiiGen) is a violator space for which the basis of G C F 
is a minimal generating set of the ideal (G). 

The next ingredient in this problem is the combinatorial dimension: the size of the largest minimal 
generating set. This natural combinatorial dimension already exists in commutative algebra, namely, 
it equals a certain Betti number. (Recall that Betti numbers are the ranks Pij of modules in the 
minimal (graded) free resolution of the ring 91//; see, for example, [24( Section IB (pages 5-9)].) 
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Specifically, the number /3oj' is defined as the number of elements of degree j required among any set 
of minimal generators of I. The (0-th) total Betti number of IH//, which we will denote by /3(1H//), 
simply equals /3oj, and is then the total number of minimal generators of the ideal I. It is well 
known that while I has many generating sets, every minimal generating set has the same cardinality, 
namely /3(1H//). In conclusion, it is known that 

Observation 5.3. The combinatorial dimension for {H,\/s^allGen) is the (0-th) total Betti number 
of the ideal I = (H)] in symbols, (3{y\./I) = 5{H,\/ smaiiGen)- 

Although it may be difficult to exactly compute I3{y{/I) in general, a natural upper bound for 
j3{d\/I) is the Betti number for any of its initial ideals (the standard inequality holds by upper- 
semicontinuity; see e.g. [38( Theorem 8.29]). In particular, if H is known to contain a Grobner 
basis with respect to some monomial order, then the combinatorial dimension can be estimated by 
computing the minimal generators of an initial ideal of (H), which is a monomial ideal problem and 
therefore easy. In general, however, we only need j3(y{/I) < \H\ for the proposed algorithms to be 
efficient. 

The last necessary ingredient is the primitive query for MSmaiiGen- 

Observation 5.4. The primitive query for MsmaiiGen, deciding if h e VSmaiiGen{G) given h e H 
and G d H, is an ideal membership test. 

Of course, the answer to the query is usually Grobner-based, but, as before, the size of the 
subsystems G on which we call the primitive query is small: (0(5^)). In fact, it is easy to see that 
many small Grobner computations for ideal membership cost less than the state-of-the-art, which 
includes at least one large Grobner computation. 

Proof of Theorem \1.2[ From Lemma (5.21 we know {H,\/smaUGen) is a violator space and we have 
shown it has a combinatorial dimension and a way to answer the primitive test. Having these 
ingredients. Theorem 13.71 holds and it is possible for us to apply Clarkson’s Algorithm. □ 

Remark 5.5. Intuitively, the standard algorithm for finding minimal generators needs to at least 
compute a Grobner basis for an ideal generated by \H\ polynomials, and in fact it is much worse 
than that. One can simplify this by skipping the computation of useless S-pairs (e.g. as in [30]), but 
improvement is not by an order of magnitude, overall. The algorithm remains doubly exponential in 
the size of H for general input. In contrast, our randomized algorithm distributes the computation 
into many small Grobner basis calculations, where “many” means no more than 0{f3\H\ -1-/3^), and 
“small” means the ideal is generated by only 0{fP‘) polynomials. 

To conclude, in a forthcoming paper we will study further the structure of our violator spaces 
and discuss the use of more LP-type methods for the same algorithmic problems. We also intend to 
present some experimental results for the sampling techniques we discussed here. We expect better 
performance of the randomized methods versus the traditional deterministic algorithms. 
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